wo 03/077838 



SEQUENCE LISTING 

SEQ ID N0:1 full length nucleic acid 89.6 gpl60 Env sequence 

atgagagt gaaggagatc aggaagaatt ggcagcactt gagagggggc 
atcttgctcc ttgggatgtt gatgatctgt agtgctgcaa aagaaaaaac 
gtgggtcaca atctattatg gggtacctgt gtggagagaa gcaaccacca 
ctctattttg tgcatcagat gctaaagcct atgatacaga ggtacataat 
gtttgggcca cacatgcctg tgtacccaca gaccccaacc cacaagaagt 
agtattggga aatgtgacag aaaattttaa catgtggaaa aataacatgg 
tagatcagat gcatgaggat ataatcagtt tatgggatga aagcctaaag 
ccatgtgtaa aattaacccc actctgtgtt actttaaatt gcactaattt 
gaatatcact aagaatacta ctaatcccac tagtagcagc tggggaatga 
tggagaaagg agaaataaaa aattgctctt tctatatcac cacaagcata 
agaaataagg taaagaaaga atatgcactt tttaatagac ttgatgtagt 
accaatagaa aatactaata atactaagta taggttaata agttgtaaca 
cctcagtcat tacacaggcc tgtccaaagg tatcctttca gccaattccc 
atacattatt gtgtcccggc tgggtttgcg atgctaaagt gtaacaataa 
gacattcaat ggatcaggac catgcacaaa tgtcagfcaca gtacaatgta 
cacatggaat taggccagtg gtgtcaactc aactgctgtt aaatggcagt 
ctagcagaag aagacatagt aattagatct gaaaatttca cagacaatgc 
taaaaccata atagtacagc taaatgaatc tgtagtaatt aattgtacaa 
gacccaacaa caatacaaga agaaggttat ctataggacc agggagagca 
ttttatgcaa gaagaaacat aataggagat ataagacaag cacattgtaa 
cattagtaga gcaaaatgga ataacacttt acaacagata gttataaaat 
taagagaaaa atttaggaat aaaacaatag cctttaatca atcctcagga 
ggggacccag aaattgtaat gcacagtttt aattgtggag gggaattttt 
ctactgtaat acagcacaac tgtttaatag tacttggaat gttactggag 
ggacaaatgg cactgaagga aatgacataa tcacactcca atgcagaata 
aaacaaatta taaatatgtg gcagaaagta ggaaaagcaa tgtatgcccc 
tcccatcaca ggacaaatta gatgttcatc aaatattaca gggctgctac 
taacaagaga tggaggtaat agtactgaga ctgagactga gatcttcaga 
cctggaggag gagatatgag ggacaattgg agaagtgaat tatataaata 
taaagtagta agaattgaac caataggagt agcacccacc agggcaaaga 
gaagaacagt gcaaagagaa aaaagagcag tgggaatagg agctgtgttc 
cttgggttct tgggagcagc aggaagcact atgggcgcag cgtcagtgac 
gctgacggta caggccaggc tattattgtc tggtatagtg cagcagcaga 
acaatctgct gagggctatt gaggcgcaac agcatatgtt gcaactcaca 
gtctggggca tcaagcagct ccaggcaaga gtcctggctc tggaaagata 
cctaagggat caacagctca tgggaatttg gggttgctct ggaaaactca 
tttgcaccac ttctgtgcct tggaatgtta gttggagtaa taaatctgtg 
gatgatattt ggaataacat gacctggatg gagtgggaaa gagaaattga 
caattacaca gactatatat atgacttact tgaaaaatcg caaacccaac 
aagaaaagaa tgaaaaagaa ttattggaat tggataaatg ggcaagtttg 
tggaattggt ttgacataac aaactggctg tggtatataa gattattcat 
aatgatagta ggaggcttga taggtttaag aatagttttt gctgtacttt 
ctatagtaaa tagagttagg cagggatatt caccattatc gtttcagacc 
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ctcctcccag 
aggtggagag 
tggcactttt 
ctcttgagaa 
acgcaggggg 
ggagccagga 
atagcagtag 
ttgtagagct 
gggctttgct 



cctcgagggg 
agagacagag 
ctgggtcgat 
acttactctt 
tgggaagccc 
actaaagaat 
ctgaggggac 
attcgcaaca 
ataa 



acccgacagg 
acagatccgg 
ttgaggaacc 
gattgtaacg 
tcaaatattg 
agtgctgtta 
agatagggtt 
tacctacaag 



cccgaaggaa 
tccattagtg 
tgtgcctctt 
aggattgtgg 
gtggaatctc 
gcttgctcaa 
ataaaaatag 
aatcagacag 



cagaagaaga 
aacggattct 
cctctaccac 
aacttctggg 
ctgcaatatt 
cgccacagcc 
tacaaagagc 
ggcttggaaa 



SEQ ID NO:2 Amino acid sequence of 89.6 gpl60 

1 MRVKEIRKNW QHLRGGII.LL GMLMICSAAK EKTWVTIYYG VPVWREATTT 

51 liFCASDAKAY DTEVHNVWAT HACVPTDPNP QEWLGNVTE NFNMWKNNMV 

101 DQMHEDIISL WDESLKPCVK LTPIiCVTIiNC TNLNITKNTT NPTSSSWGMM 

151 ERGEIKNCSF YITTSIRNKV KKEYALFNRL DWPIENTNN TKYRLISCNT 

201 SVITQACPKV SFQPIPIHYC VPAGFAMIiKC NNKTFNGSGP CTNVSTVQCT 

251 HGIRPWSTQ LIiIiNGSIiAEE DIVIRSENFT DNAKTIIVQIi NESWINCTR 

301 PNNNTRRRLS IGPGRAFYAR RNIIGDIRQA HCNISRAKWN NTIjQQIVIKL 

351 REKFRNKTIA FNQSSGGDPE IVMHSFNCGG EFFYCNTAQIj FNSTWNVTGG 

401 TNGTEGNDII TLQCRIKQII NMWQKVGKAM YAPPITGQIR CSSNITGLLIi 

451 TRDGGNSTET ETEIFRPGGG DMRDNWRSEL YKYKWRIEP IGVAPTRAKR 

501 RTVQREKRAV GXGAVFLGFIi GAAGSTMGAA SVTLTVQARIi IiLSGXVQQQN 

551 NLIjRAIEAQQ HMLQLTVWGI KQIjQARVIiAL ERYIjRDQQIiM GIWGCSGKLI 

601 CTTSVPWNVS WSNKSVDDIW NNMTWMEWER EIDNYTDYIY DLIjEKSQTQQ 

651 EKNEKELLEL DKWASLWNWF DITNWLWYIR LFIMIVGGIiI GDRIVFAVLS 

701 IVNRVRQGYS PLSFQTIiLPA SRGPDRPEGT EEEGGERDRD RSGPLVNGFL 

751 ALFWVDIiRNIi CIiFLYHLLRN IjLLIVTRIVE LLGRRGWEAL KYWWNIiLQYW 

801 SQEIiKNSAVS LliNATAIAVA EGTDRVIKIV QRACRAIRNI PTRIRQGLER 

851 ALIi* 



SEQ ID NO:3 89.6 gpl40 Nucleic Acid sequence 

atgagagt gaaggagatc aggaagaatt ggcagcactt gagagggggc 
atcttgctcc ttgggatgtt gatgatctgt agtgctgcaa aagaaaaaac 
gtgggtcaca atctattatg gggtacctgt gtggagagaa gcaaccacca 
ctctattttg tgcatcagat gctaaagcct atgatacaga ggtacataat 
gtttgggcca cacatgcctg tgtacccaca gaccccaacc cacaagaagt 
agtattggga aatgtgacag aaaattttaa catgtggaaa aataacatgg 
tagatcagat gcatgaggat ataatcagtt tatgggatga aagcctaaag 
ccatgtgtaa aattaacccc actctgtgtt actttaaatt gcactaattt 
gaatatcact aagaatacta ctaatcccac tagtagcagc tggggaatga 
tggagaaagg agaaataaaa aattgctctt tctatatcac cacaagcata 
agaaataagg taaagaaaga atatgcactt tttaatagac ttgatgtagt 
accaatagaa aatactaata atactaagta taggttaata agttgtaaca 
cctcagtcat tacacaggcc tgtccaaagg tatcctttca gccaattccc 
atacattatt gtgtcccggc tgggtttgcg atgctaaagt gtaacaataa 
gacattcaat ggatcaggac catgcacaaa tgtcagcaca gtacaatgta 
cacatggaat taggccagtg gtgtcaactc aactgctgtt aaatggcagt 
ctagcagaag aagacatagt aattagatct gaaaatttca cagacaatgc 
taaaaccata atagtacagc taaatgaatc tgtagtaatt aattgtacaa 
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gacccaacaa caatacaaga agaaggttat ctataggacc agggagagca 
ttttatgcaa gaagaaacat aataggagat ^taagacaag cacattgtaa 
cattagtaga gcaaaatgga ataacacttt acaacagata gttataaaat 
taagagaaaa atttaggaat aaaacaatag cctttaatca atcctcagga 
9999acccag aaattgtaat gcacagtttt aattgtggag gggaattfctt 
ctactgtaat acagcacaac tgtttaatag tacttggaat gttactggag 
ggacaaatgg cactgaagga aatgacataa tcacactcca atgcagaata 
aaacaaatta taaatatgtg gcagaaagta ggaaaagcaa tgtatgcccc 
tcccatcaca ggacaaatta gatgttcatc aaatattaca gggctgctac 
taacaagaga tggaggtaat agtactgaga ctgagactga gatcttcaga 
cctggaggag gagatatgag ggacaattgg agaagtgaat tatataaata 
taaagtagta agaattgaac caataggagt agcacccacc agggcaaaga 
gaagaacagt gcaaagagaa aaaagagcag tgggaatagg agctgtgttc 
cttgggttct tgggagcagc aggaagcact atgggcgcag cgtcagtgac 
gcfcgacggta caggccaggc tattattgtc tggtatagtg cagcagcaga 
acaatctgct gagggctatt gaggcgcaac agcatatgtt gcaactcaca 
gtctggggca tcaagcagct ccaggcaaga gtcctggctc tggaaagata 
cctaagggat caacagctca tgggaatttg gggttgctct ggaaaactca 
tttgcaccac ttctgtgcct tggaatgtta gttggagtaa taaatctgtg 
gatgatattt ggaataacat gacctggatg gagtgggaaa gagaaattga 
caattacaca gactatatat atgacttact tgaaaaatcg caaacccaac 
aagaaaagaa tgaaaaagaa ttattggaat tggataaatg ggcaagtttg 
tggaattggt ttgacataac aaactggctg tggtatataa gattattcat 
aat 



S£Q IDNO:4 Amino acid sequence of 89.6 gpl40 

1 MRVKEIRKNW QHLRGGIIiLIi GMLMICSAAK EKTWVTIYYG VPVWREATTT 

51 IiFCASDAKAY DTEVHNVWAT HACVPTDPNP QEWLGNVTE NFNMWKNNMV 

101 DQMHEDXISL WDESLKPCVK LTPLCVTLNC TNIiNITKNTT NPTSSSWGMM 

151 EKGEIKNCSF YITTSIRNKV KKEYAIiFNRIi DWPIENTNN TKYRLISCaSPT 

201 SVITQACPKV SPQPIPIHYC VPAGFAMLKC NNKTFNGSGP CTMVSTVQCT 

251 HGIRPWSTQ LLIiNGSIiAEE DIVIRSENFT DNAKTIIVQL NESWINCTR 

301 PNNNTRRRLS IGPGRAFYAR RNIIGDIRQA HCNISRAKWN NTIjQQIVIKL 

351 REKPKNKTIA FNQSSGGDPE IVMHSFNCGG EFFYCNTAQI* FNSTWNVTGG 

401 TNGTEGNDXI TIiQCRIKQXI NMWQIOTGKAM YAPPXTGQIR CSSNlTGhLh 

451 TRDGGNSTET ETEXFRPGGG DMRDNWRSEL YKYKWRXEP XGVAPTRAKR 

501 RTVQREKRAV GIGAVFLGFI, GAAGSTMGAA SVTIiTVQARL LLSGXVQQQN 

551 NLIiRAXEAQQ HMLQLTVWGI KQLQARVIiAL ERYIiRDQQLM GXWGCSGKLX 

601 CTTSIVPWNVS WSNKSVDDIW NNMTWMEWER EXDNYTDYXY DIiLEKSQTQQ 

651 EKNEKELLEIi DKWASLW* 



SEQ ID NO:5: Nucleic acid sequence of env 89.6 indicating cleavage site and start of 
the gp41 region. 

7747 

CAA AGA GAA AAA AGA GCA GTG GGA AT^y 
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SEQ ID NO:6 Amino acid sequence of env 89.6 indicating cleavage site and start of tlie 
gp41 region. 

REKRAVGI 
505 506 507 508 509 



SEQIDNO:? Amino 



1 


KEKTWVTIYY 


51 


PQBWIjGNVT 


101 


CTNIiNITKNT 


151 


IiDWPIENTN 


201 


CNNKTFNGSG 


251 


TDNAKTIIVQ 


301 


AHCNISRAKIi 


351 


OEFFYCaJTAQ 


401 


MYAPPITGQI 


451 


LYKYKWRIE 


501 




551 


RAXEAQQHML 


601 


SVPWNVSWSN 


651 


EKEIiIiEIiDKff 



icid sequence of gpl40-26 

GVPVWREATT TLFCASDAKA 
ENFNMWKNNM VDQMHEDIIS 
TOPTSSSLGM MEKGBIKNCS 
NTKYRI.ISCN TSVITQACPK 
PCTNVSTVPC THGIRPWST 
IiNESWINCT RPNNNTRRRL 
NNTLQQIVIK LREKFRNKTI 
LFNSTIiNVTG GTNGTEENDI 
ICSSNITGLL liTRDGGNSTE 
PIGVAPTRAK RRTC OGGIDG 
AVFIjGFIiGAA GSTMGARSVT 
QLTVWGIKQL QARVIALERY 
KSVDDXWNNM TWMEIiEREXD 
ASLWKLV 



(soluble, secreted protein) 

YDTEVHNGWA THACVATDPK 
IiLDESIiKPCV KliTPLCVTLN 
FYITTSIRNK VKKEYALFNR 
VFFQPIAIHY CVPAGFAMIjK 
QLXjIiNGSItAE EOXVXRSGNF 
SXGPGRAFYA RRNXXGDXRQ 
AFNQSSGGDP EXVMHSFNCG 
XTLQCRXKQX XNMWQKVGK^ 
TETEXFRPGG GDMRDNWRSE 
ILOISGSGSG GSGOGSSSGG 
LTVQARIiLLS GIVQQQNNLIi 
LRDQQIiMGIW GCSGKLICTT 
NYTDYXYDIiL EKSQTQQEKM 



SEQ ID NO:8 Amino acid sequence of gpl40-15 (soluble, secreted protein) 

1 KEKTWVTIYY GVPVWREATT TLFCASDAKA YDTEVHNGWA THACVATDPN 

51 PQEWLGNVT ENFNMWKNNM VDQMHEDXIS LliDESIiKPCV KDTPIjCVTIjN 

101 CTNIiNXTKNT TNPTSSSLCa^ MEKGEIKNCS FYITTSIRNK VKKEYALFNR 

151 LDWPIENTN NTKYRLISCN TSVITQACPK VFFQPIAIHY CVPAGFAMIiK 

201 CNNKTFNGSG PCTNVSTVPC THGIRPWST QliLLNGSIiAE EDIVTRSGNF 

251 TDNAKTIIVQ IiNESWINCT RPNNNTRRRL SIGPGRAFYA RRNIIGDIRQ 

301 AHCNISRAKL NNTLQQIVIK LREKFRNKTI AFNQSSGGDP EIVMHSFNCG 

351 GEFPYCNTAQ LFNSTLNVTG GTNGTEENDI ITLQCRIKQI INMWQKVGKA 

401 MYAPPITGQI ICSSNITGLL LTRDGGNSTE TETEIFRPGG GDMRDNWRSE 

451 LYKYKWRIE PIGVAPTRAK RRTC OGGIDG ILOISS5QGA GGjagA VGIGA 

501 VFLGFLGAAG STMGARSVTL TVQARLLLSG IVQQQNNLLR AIEAQQHMLQ 

551 LTVWGIKQLQ ARVLALERYL RDQQLMGXWG CSGKLICTTS VPWNVSWSNK 

601 SVDDIWNNMT WMELEREXDN YTDYIYDLLE KSQTQQEKNE KBLLELDKWA 

651 SLWKLV 
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SEQ n> NO :9 Four Amino acid peptide linker sequence GILI 

SEQ BO NO :10 Fifteen Amino acid peptide linlcer sequence GGIDGILQISSSGQA 

SEQ ID NO :11 Twenty-six Amino acid peptide linlcer sequence 

GGIDGILQISGSGSGGSGQGSSSGGA 

SEQ ID NO :12 Fifteen Amino acid peptide linlcer sequence 

(GGSGG)3 

SEQ ID NO :13 Twenty Amino acid peptide linker sequence 
(GGSGG)4 

SEQ ID NO :14 Twenty five Amino acid peptide linker sequence 

(GGSGG)5 

PA 3205456 vl 
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